THE CLIQUE DENSITY THEOREM. 



(N 
O 



o 
u 



CHRISTIAN REIHER 

Abstract. Consider the following question: Fix an integer r ^ 3 as well as a 
real parameter 7 e [0, 1/2) and let n be a large integer. How many cliques of 
size r must a graph on n vertices with at least 771^ edges necessarily contain? 
As n tends to infinity, the answer turns out to be of the form 



O' F,(7) •n'- + OK-2) 

D ■ 

, for some constant Fr{'j) the precise value of which has been conjectured by 

LovASZ and Simonovits in the 1970s. Despite of some impressive progress 
by BOLLOBAS and these two authors themselves, their conjecture has remained 
widely open until very recently when Razborov and Nikiforov managed to 
solve the cases r — 3 and r — A, respectively. In the present paper we prove 
this Clique Density Conjecture inductively for all values of r. An interesting 



. corollary to this result asserts that the graphs that are extremal with respect 



to this problem do not depend much on r - a fact that can be made precise by 
talking about graphs with weighted vertices. 



1. Introduction. 



> 

in 

CN ■ Given an integer r ^ 3 and a graph Q with n vertices and 'jn'^ edges, where 

^ ' ^ 7 < ^, how many chques of size r must Q necessarily have? This question 

CN ' presents itself almost naturally once one knows about Turan's classical theorem 

which informs us in particular of the mere existence of such cliques as soon as 
^ , 7 becomes larger than 2{^-i) ^^^^ ^^^^ bound is indeed sharp. In awareness of 

■ the structure of the unique graphs which are extremal with respect to the absence 

" ■ ■ of r-cliques, namely complete (r — l)-partite graphs the sizes of whose vertex 

classes are as close to one another as possible, it is quite tempting to propose the 
following construction for guessing the answer to the more general question we are 
just discussing: Form for some positive integer s a complete [s + l)-partite graph 
all of whose vertex classes except for one perhaps substantially smaller one are 
of equal size in such a way that the number of its edges is about 7n^ and count 
the number of r-cliques the graph thereby obtained possesses. Speaking rather 
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roughly, ignoring for a moment the phenomenon that not all real numbers are 
integers and interpreting the word "about" in the foregoing sentence as meaning 
"precisely" , this can be accomplished in exactly one way: Namely, one determines 

, takes a G [O,^] to be a solution of 
7 = 2{J+i) (-*-~'^^) arranges the s larger vertex classes of the multipartite graph 
one intends to exhibit to consist of exactly "^^^1"'' vertices, whereas the remaining 
one comprises just vertices. The resulting graph is easily computed to 

contain 

(^('^')(l + «r^(l-(r-l)a)-n^ 

cliques of size r. This calculation motivates the clique Chque Density Conjecture 
proposed by LovASZ and Simonovits (see [6]), which is the main result of this 
paper, an that reads as follows: 

Conjecture 1.1. //r ^ 3 and 7 G [O, |), then every graph on n vertices with at 
least 7n^ edges contains at least 

(^('^')(l + «r^(l-(r-l)a)-n^ 

cliques of size r, where s 1 is an integer for which 7 G [^j^, 2{i+T)] '^^'^ ^ t^' s] 
is implicitly defined by •y = 2{s+i) ~ '^^')- 

Before we start to prove this, a few historical and mathematical remarks are in 
order and to make the former ones more perspicuous we start with the latter ones. 

First one observes that if s ^ r — 2, then the binomial coefficient {^^^) vanishes, 
which means that in accordance with Turan's result no r-cliques are guaranteed 
to exist whenever 7 ^ 2{r'-i) ■ soon as 7 passes this threshold value by an 
absolute positive amount however tiny, the minimal number of r-cliques starts to 
become proportional to n'^. Second, if 7 happens to be of the form 2{t+i) some 
positive integer t, then there are two legitimate choices for the pair (s, a), namely 
(t, 0) and {t + 1, j^)- Yet it is not hard to verify that both of these give rise to 
the same lower bound of 

1 ft + l 

{t + lY\ r 

on the number of r-cliques. Incidentally, as we shall see below, these boundary 
cases are the easiest to deal with. Third, it should be clear in the light of the con- 
siderations sketched above that this claim, if true, is asymptotically best possible 
when we think of r and 7 as being fixed and of n as tending to infinity. In other 



an integer s ^ 1 for which 7 G 



s— 1 s 
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words, the bizarre expression appearing in the above formula before the multiph- 
cation dot is insofar "correct" as it is the best possible one not depending on n. 
On the much more ambitious question where one also regards n as being fixed 
and asks for the minimal number of r-cliques in an absolute sense, we will not 
say anything intelligent in this paper going beyond Conjecture 11.11 It neverthe- 
less appears worth while to mention at this point an observation apparently due 
to NiKlFOROV ([7J) without giving further details here: If one repeats the above 
calculation more carefully, i.e. using real integers instead of their approximations, 
one may see that for fixed values of 7 and r the error inherent in the above formula 
is of a lower order than one would perhaps guess at first, i.e. it is not only 0{rf~^), 
but even 0{rf^'^). 

We now turn to some historical comments: The apart from TURAN's work first 
attempt to bound the number kr{Q) of r-Cliques contained in a graph Q from 
below was undertaken by Goodman ([5J) who proved k^{Q) ^ §7(47 — l)'n? 
whenever Q has n vertices and at least 7n^ edges. This was extended by LoVASZ 
and SiMONOVlTS ([6]) to 

K{G) > \ ■ 27(47 - 1)(67 - 2) ■ . . . ■ (2(r - 1)7 - (r - 2)) ■ 

for all r ^ 3 and 7 ^ 2{r~-i) ■ might be helpful to observe that this lower bound 
agrees with Conjecture [TTT] whenever 7 = 2(t+i) "^^ere t ^ r — 2 denotes an integer, 
and that it is piecewise convex as a function of 7 between these values. The first 
and in this generality as far as we know hitherto never defeated improvement over 
this result was obtained by Bollobas (pQ, see also section 1 of chapter VI from 
[2]), who devised a miraculous argument demonstrating that the piecewise linear 
function interpolating between these boundary values also serves as a lower bound 
of kr{Q). The first person who almost achieved a whole continuum of cases of 
Conjecture [LT] was Fisher (|3]) who attacked the case r = 3 and s = 2 by means 
of ideas belonging to the spectral theory of graphs and relying on a then unproved 
hypothesis regarding the zeros of clique polynomials he managed to solve this case; 
the conjecture on which he relied has later been established by Goldwurm and 
Santini ([4j). An altogether different proof of this case has later been given by 
Razborov in the fifth section of [H] . The approach described there is of a highly 
infinitary nature and is based upon what one might call the "differential calculus 
of flag algebra homomorphisms" , which in turn constitutes an important part of 
Razborov's flag algebraic investigations that are designed to provide us with a 
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framework for thinking about problems from extremal combinatorics in general 
in a much more systematic fashion than it has ever been accomplished before. 
Shortly afterwards, Razborov dedicated a whole paper to the application of his 
calculus to the minimal triangle density of graphs with given edge density in which 
he entirely proved the case r = 3 of Conjecture 11.11 His argument proceeds by 
induction on s. Upon seeing it, NiKlFOROV realized that one could get finitary 
analogues of some of Razborov 's equations in a setting similar to the one utilized 
by MOTZKIN and Strauss in their alternative proof of Turan's theorem 
by exploiting Lagrange multipliers and this insight allowed him to give another 
proof of this result in a more structural spirit and dealing with all values of s at 
the same time. With some modifications, this proof applies to the case r = 4 as 
well ([7]). Given these quite rapid recent developments, we may think of ourselves 
as "standing on the shoulders of giants", as Sir Isaac Newton would express 
it. Slightly less informally, we shall try to explain below how to emulate still 
more substantial fragments of Razborov 's flag algebraic differential calculus in 
the finitary analytical setting used by NiKlFOROV and then perform a technically 
more elaborate version of Razborov 's original argument on triangles. 

It should be pointed out at this occasion, however, that the preference we have 
given to a rather concrete presentation over entirely abstract manipulations ex- 
clusively serves expository purposes and is in no way enforced by the structure of 
our argument itself. On the contrary, the reader who has seriously studied [S] and 
[TU] will have no difficulty whatsoever in recasting the relevant passages of this 
paper in terms of flag algebra homomorphisms. 

2. Weighted Graphs. 

Given a set X and a positive integer r, we use X^'^^ to denote the collection of 
all r-element subsets of X. Also, if n refers to a positive integer, then [n] is, by 
definition, shorthand for {1,2, ... ,n}. By a weighted graph of order n, we mean 
a pair consisting of a sequence (xi,X2, . . . ,x„) of non-negative real numbers the 
sum of which is equal to 1 and a function A : [n]*^^-* — > [0, 1]. Whenever Q is such 
a weighted graph of order n and r is a positive integer, we set 

Notice for instance that 



G{Ki) = Xi + X2 + . . . + Xn = 1- 



THE CLIQUE DENSITY THEOREM. 5 

The following variant of Conjecture 11.11 is. as we are soon going to see, equivalent 
to it and has to the best of our knowledge first been formulated explicitly by 
NiKIFOROV. 

Claim 2.1. Let r ^ 3 denote an integer and Q a weighted graph. Suppose that 
a positive integer s and a real number a G [O, -^j are chosen in such a way that 

^(^2) = 2(iTT)(l-«')- ^^^^ 

Q^Kr) ^ + + ay-^ (1 - (r - l)a) . 



To see that this indeed entails Conjecture 11.11 take a graph Q and an integer 
r ^ 3 about which you want to know the latter statement, label the vertices of Q 
arbitrarily as {fi, f2, . . . , fn} and construct a weighted graph Q' of order n by the 
stipulations xi = X2 = ■ ■ ■ = Xn = - and 



if Vi and Vj are joined in Q by an edge, 
otherwise 



for all {i, j} G [ri]^^^. Plainly Q has exactly ^'(^2) -n^ edges and Q'{Kr) -n^ cliques 
of size r, which proves the desired implication. 

As we shall not need the converse direction, we only give a sketch of its proof. 
Given a weighted graph of order n specified by the sequence (xi,a;2, . . . ,a;„) of 
real numbers as well as a function A : [n]^^) — > [0, 1], take a very large integer k 
and form a graph % whose vertices fall into n independent classes Vi, V2, . . . , 
whose sizes are approximately /cxi, kx2, ■ ■ ■ , kxn, respectively and in which for each 
{i,j} G [n]^'^^ roughly a proportion of A{{i,j}) among all possible edges from Vi to 
Vj is present in a sufficiently quasirandom way. Such a graph T-L can in particular 
be arranged to have k vertices, ^(-ft'2) ■ k"^ ± 0{k) edges and Q{Kr) ■ k^ ± 0{k^~^) 
cliques of size r, so letting k tend to infinity we might in fact derive Claim 12.11 
from Conjecture 11.11 

Throughout the rest of this paper, we follow NiKIFOROV 's suggestion to think 
about the clique density problem in terms of weighted graphs, as this allows us to 
apply our knowledge concerning continuous and piecewise differentiable or convex 
functions rather directly without necessitating the usage of fiag algebraic circum- 
locutions. In other words, among all possible limits of graphs one could refer to 
in terms of fiag algebra homomorphisms we only exploit a few somewhat special 
ones that are rather easily visualized. 
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3. The bound obtained by Lovasz and Simonovits. 

We are now going to provide an analogue of the inequality due to LoVASZ and 
Simonovits formulated in the language of weighted graphs. Although it is clearly 
weaker then Claim 12.11 and thus soon going to be superseded, we are nevertheless 
needing it for two purposes. First, it gives an optimal result if 7 = ^ for some 
positive integer t and our later arguments do not apply to that case - basically 
because the function by means of which Q{Kr) is going to be estimated in terms 
of 7 is not differentiable at those values. Second, it will tell us that if we choose 
a counterexample to Claim 12.11 that is extremal in a sense on which we shall 
elaborate below, then it still cannot be wicked to such an extent that the analytical 
estimates we intend to apply later on are precluded from leading to the envisaged 
contradiction - despite the fact that they are in some sense only "locally true" . 

Clearly the easiest way to obtain the inequality stated as Proposition 13.11 below 
would have been to accept the result of LovASZ and Simonovits as a matter 
of fact and then to argue that the proposition itself follows from it by using the 
same construction we have indicated at the end of the previous section. Yet there 
is a reason which makes us thinking that it might be useful to include the argu- 
ment presented below into this paper. Namely in section 5 the reader is going to 
encounter a very similar but technically somewhat more demanding computation. 
So what we are doing here might be regarded as a helpful preparation for other 
things that follow. 

Proposition 3.1. Given a weighted graph Q and an integer r ^ 2 such that the 
quantity 7 = ^(^2) is not smaller than 2IT-1) > have 

GiKr) > ^ ■ 27(47 - 1)(67 - 2) ■ . . . ■ (2(r - 1)7 - (r - 2)) . 
r\ 

Proof. Notice that this follows by means of an easy induction on r from the fol- 
lowing statement: 

(*) // a weighted graph Q satisfies 7 = ^ 2(r~~i) f^''" ■^'^'^^ r ^ 2, 

then 

2(r — D'y — (r — 2) 

g{Kr) > ^ ^ ^ ■ GiKr-i) and g{Kr-i) > 0. 

r 

For this reason, it suffices to verify (*) instead, which will again be done by 
induction on r. The case r = 2 of (*) is obvious in view of g{Ki) = 1. So suppose 
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now that ^ is a weighted graph satisfying 7 = Q{K2) ^ ^ > 2^F^ some 
r ^ 2 and 

r • ^ (2(r - 1)7 - (r - 2)) • g{Kr-i) as well as Q{Kr-i) > 0. 

For the induction step, we cursorily remark that these assumptions trivially entail 
Q{Kr) > so that it only remains to estimate Q{Kr+i) from below in terms of 
Q{Kr). Let n denote the order of Q and suppose that Q is given by the sequence 
(xi, X2, . . . , Xn) of reals as well as the function A : [n]^^) — > [0, 1]. 

For each M C [n] we write 

Now consider any M e [n]^''"'"^) and define 

Yi n ^(^)' 

BeM(2) FeM(2)-{£;} 

as well as 

Cm = ^7v- 

ATeMW 

We claim that these expressions satisfy 
2Bm - Cm ^ (r^ - 1)Am. 



To see this, we note that our inequality is linear in each of its variables A{E), where 
E G M^'^\ which entails that we only need to look at the case where A{E) E {0, 1} 
for all E e M(2). Now if additionally K = #{E e M^^) | A{E) = 0} is at least 2, 
then Am = Bm = and Cm ^ 0; if i^T = 1 then Am = 0, Bm = 1, Cm = 2, and 
finally if K = then Am = 1, Bm = |r(r + 1) and Cm = r + 1. This completes 
the proof of (®). 

Multiplying our estimate by Xm and summing over all possibilities for M, we infer 
(®) Y: C^Bm - Cm)Xm < (r^ - l)G(Kr+i) . 

Me[n](''+i) 

Setting 

for all L e we now investigate the sum 

E AlX,vI. 

Le[n]('-i) 
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Expanding the squares, we get several purely quadratic terms whose sum may in 
view of the inequality A^E)"^ ^ A{E), that is valid for all E E [n\^'^\ be bounded 
from above by 

Qe[n]M ie[n]-Q Me[n](^+^) 

Moreover, we get several mixed terms that sum up to 

2 • BmXm- 

Combining this with (®), we get 

^ ALXLril^g{Kr)+ {2Bm-Cm)Xm <g{Kr) + {r^-l)g{Kr+i). 

L6[n]('-i) Me[n]('-+i) 

Substituting this together with 

Le[n]('-i) QG[n]W 

and 

Le[n]('-i) 

into the inequality 

( Yl ^lXlVl"] < Y Yl "^lXlvI, 

\Le[n]('-i) / Le[n]('-i) Le[n]('-i) 

which is an instance of the CAUCHY-SCHWARZ-Inequahty, we deduce 

r'giKrf ^ g{Kr.i) {g{Kr) + (r^ - i)g{Kr+,)) . 

Invoking now the induction hypothesis, we obtain, after a permissable cancelation 

oig{Kr^i), 

(2(r - l)r7 - r(r - 2)) g{Kr) ^ g{Kr) + (r^ - l)g{Kr+i) 
and hence indeed 

(2r7 - (r - 1)) ^ (r + 

which completes the induction step. This finally proves (*) and thus the proposi- 
tion. □ 

Remcirk 3.2. The reader may find it both instructive and amusing to rewrite this 
argument in terms of flag algebra homomorphisms. 



THE CLIQUE DENSITY THEOREM. 9 

We would now like to make those consequences of Proposition 13 . II explicit that we 
shall really utilize in the sequel. 



Corollary 3.3. Suppose that r and s are integers satisfying r ^ 2 and s ^ r — 1. 

s-l 
2s 



Then for every weighted graph Q satisfying 7 = Q{K2) > one has 



Q{Kr) > 



1 fs\ ( 27 ^ '■"^ 



s \r I \ s — \ 



Proof. Clearly ^ ^ 2{r\) ' wherefore Proposition 13.11 tells us 

g{Kr) ^ ^ ■ 27(47 - 1)(67 - 2) ■ . . . ■ (2(r - 1)7 - (r - 2)) . 
r\ 

Now for each iG{l,2, ...,r — l}we have 

f sii-l)\ 27(5 -i) 

2^7 - i - 1 > 27 i - ^ = ^ 

\ s — 1 J s — 1 

and hence 



27 V ^ (s - 1) • . . . ■ (s - r + 1) 1 fs\ / 27 ^ ' ^ 



r\ ^ s ^ 

□ 

Corollary 3.4. Under the additional assumption a G {O, j}, Claim IKT\ holds. 

Proof. As we have already seen in the introduction, we have in these cases 7 = 
for some non-negative integer t. If t ^ r — 2 our claim is obvious, which 
means that we may suppose t ^ r — 1 from now on. Thus 7 is large enough for 
Proposition 13.11 to be applicable and the desired result follows. □ 

4. Some analytical preparations. 



Now we intend to provide a thorough analysis of the function occurring in Claim [2?n 
All of the observations we are going to make are rather easily established, but the 
amount of calculation on which some of them rely is not negligible. So in order 
to help the main proof of this paper to appear a bit shorter than it actually is, it 
seems to be advantageous to gather these rather routine computations in a sepa- 
rate section, the present one. The lazy and trustful reader who just wants to know 
how the clique density problem may be solved without digesting every mundane 
detail can safely take all the statements he finds here on belief and ignore their 
occasionally laborious verifications. We would also like to precaution the reader 
that we did not even attempt to motivate what is going to happen in the following 
pages. It might therefore be a sensible idea to interrupt the reading of this section 
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once the function H makes its first appearance. Thereafter one may directly jump 
to the next and hopefully more interesting section large portions of which should 
be fairly understandable then and whose remaining places should offer an intrinsic 
explanation of why certain properties of Fj. and related entities to be found here 
are worth our while to know about. 

Throughout this section, we fix integers r ^ 3 and s ^ r — 1 as well as a real 
number M ^ 1 satisfying 

r-2 



S - 1 



> 



r + 1 



M 



r-2 



Define the function Fr : [O, |) — > [O, ^) as follows: Given 7 G [O, |), choose the 

, determine the real number 



unique positive integer t for which 7 G 
a G (0, solving the equation 7 = 2{t+i) 



t-1 t 
2t ' 2(i+l) 



a 



and set 



Da) 



{t + 1)' V r 

In terms of this function, the statement of Claim 12.11 can be shortened to the 
inequality Q{Kr) ^ Fr{Q{K2)), that is allegedly valid for all weighted graphs 
Q. We have more or less already seen earlier that F^ is continuous and clearly 
it is piecewise differentiable as well. Moreover, F^. is identically vanishing on 



the interval 



r-2 
' 2(r-l) 



and if 7 G 2{t+i) ) some integer t ^ r — 1, then 



differentiating the equation locally defining -Fr(7) with respect to a, we infer 
ta , (r - l)r ft + 1' 



t + l 

As a > 0, it follows that 



{t + ly 

(r-l)r ft + l 
t{t + 1)^-1 V ^ 



a(l + a) 



r-2 



:i + ay-^ > 0. 



Thus Fr is strictly increasing on the interval 



r-2 



lim F,(7) 
7^1/2 



lim 

t — >-oo 



2{r-l) ' 2 
t + l 



{t + iy 



and as 
_ 1 

7*! 



it possesses an inverse F^ ^ : [O, ^) - 
sion for -^^(7) decreases as a decreases, w^ 



r-2 



. Moreover, the above expres- 



2{r-l) ' 2^ 

whence F^. is in an obvious sense piecewise 
concave. Note that the identity function F2 on [O, |) has essentially the same 
properties as F^. This concludes our discussion of the most elementary properties 
of these functions. 
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Next we propose to look at the function H : ■ M, M] — > given by 

Claim 4.1. The function H is strictly increasing and satisfies 
Fr-i ( Trf^l < H{M) ^ " ^ 



^2(s- 1) J ' ' " "^"^ V 2s 
Proof. If 77 e • M, M) , then 



which entails the first part of our claim. Furthermore, by our smallness condition 
imposed on M, we have 



gr-i \r-lj M'-2 s{s - ly-^ \r -Ij (s - ly-^ \r - 1 
i.e. 

H{M) > Fr-i 
Finally, using M ^ 1, we get 



s - 2 
2(^-1) 



□ 



Notice in particular that the composed function o H is defined everywhere 
on the interval ■ M, M] and that its range is contained in 



r-3 s-1 



2(r-2)^ 2s J'^hich 

in turn lies in [O, |). For t E {r — 2, r — 1, . . . , s — 1} there exists a unique real 
number 'dt G ■ M, Af] satisfying Hi^t) = i^r-i (^)- The number 'ds-i will 
play a special role later and sometimes it will just be denoted by i9. Evidently one 
has 

Later on we shall need some rudimentary knowledge concerning the magnitude of 
these numbers. 

Claim 4.2. // the integer t belongs to the interval [r — 2, s — 2], then -dt ^ ■ M . 
In addition, we have 'd ^ — • M . 
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Proof. Whenever t belongs to the specified interval, we have ^ H{M), 

i.e. 

t+l\ 1 f s \ 1 



rr-2 



(t + 1)^-1 \r-lj s^-^ \r-lj M 
Multiplying this by - r + 2)(i + If-'^/f-'^ we infer H{§t) ^ H{-^ ■ M), thus 
proving the first part of our claim. Similarly but slightly easier we infer from 
M ^ 1 that ■ M) ^H{'d), which leads to the second part of the claim. □ 

Claim 4.3. If r] e ■ M, M] and v = {F^\ o H){r]), then the function Q : 
[0, u] — y M defined by 

5 ^ (r - 1) (^^ ^ 5 - s'-'r]'-^Fr{5) 

attains its global maximum at 5 = u. 

Proof. Choose an integer t e [r — 2, s — 1] as well as a real number /3 e [0, |] such 
that 1/ — 2{t+i) (-*- ~/^^)- Since Q is piecewise convex and as convex functions attain 
their global maxima at boundary values, it suffices to establish the following two 
statements: 

(A) The function Q is increasing on u] . 

(B) Iide[t-l],thenQ{'-=^)^QC-=l). 

For the proofs of both of these subclaims, we use 

which is an obvious consequence of H{dt) ^ H{M). Now, to verify (A), take any 
6 G {^,1^) and write 6 = 2(t+i) ~ '^^)' '^here a G |). Multiplying (C) by 
(r — l)s^~^ one obtains 

IV ft + i\ ft + iy~^ , s 



t{t+iy-^\ r J \ t J ^ \r-l 
In view of 77 ^ M and a ^ | this implies 

^ (r - 1) 



whence Q'{S) ^ 0. Thereby we have proved assertion (^4). 

Let us now turn our attention to (B). If t ^ r — 1, then Fr vanishes at all relevant 
numbers and our claim is obvious. So henceforth we may suppose t ^ r and for 
similar reasons (i ^ r — 1 as well. The function $ : [7, ^] — > [0, 1] defined by 

x\ — x){l - 2x) • . . . • (1 - (r - l)x) 
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is obviously convex, wherefore 

* (i) - * (a 



d t 

Since 



it follows that 



1 ft\ lfd\^fl l\ (r-l)r 1 ft 



V\rJ dr\rj \d t J 2{t - r + 1) V-^\r, 

ft-l d-l\ I f t 

= (r - 1) 

Multiplying this by 



2t 2d / Vr - 1 



which is a consequence of (C) as ^ M, we deduce 



1 /t\ 1 /rfM , J s \ f t - I d-l 



r\rj d'yrj ] ^ '\r -IJ \ 2t 2d 
which is easily seen to be equivalent to {B). □ 

The following will only be used for A; = 2 and k — r, but the general case is not 
really harder. 

Claim 4.4. For each integer k ^ 2 the function J : • M, M] — > R defined by 

r^^^\F,oF;_\oH){r^) 

is concave or convex on the intervals [dt, 'dt+i], where t = r — 2,r — l,...,s — 2, 
as well as [d, M] depending on whether k ^ r — 1 or k ^ r — 1. 

Proof. Treating both cases for k at the same time, we select any t e {r — 2, r — 
1, . . . , s — 1} and intend to verify that the second derivative of J has the expected 
sign on where for convenience ■j?^ = M. Utilizing that x i — > is 

strictly increasing on (0,1) we may define a function S : {'&t:'&t+i) — > (j^A) 
such that 

(^-^°^)(^)=2(^--S(;^ 
holds for all r] e (i^t, i^t+i). Since the right hand side may be rewritten as 



2(( + l) vsw 
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we have 



s \ {r - l)ri - {r -2)M 1 ft + A (r - l)S{ri) - (r - 2) 



Differentiating with respect to rj and dividing by (r — 2)(r — 1), we find 
s \ M-T] _ 1 ft+l\ 1-S{r]) 



and the combination of both equations yields 

^ SjyyM-r^) [(r-l)Siv)-ir-2)] 
^ r/(l - S{r])) [{r - l)r] - (r - 2)M] ' 

Furthermore 



(t + l)'=V k J S{r]y 
Differentiating and using the above formula for S'{r]), we get 

1 A + A kri''-mr-l)S{ri)ri-{k-l)r]+{k-r + l)S{ri)M] 
~ {t + iy \ k ) S{r]y [(r - l)ri - (r - 2)M] ' 

A repetition of that argument leads to 

1 [t + l\ k{k-l){r - k-l)r]''-^[S{7])M -7]f 



(t + l)'=V k J S (riy {1 - S iri))[{r- 1)7] ~{r- 2) Mf' 
which entails the desired conclusion in view of the presence of the factor r — k — 1 
in the numerator. □ 



Claim 4.5. For each t] e ■ M, M] the dzfft 



erence 



(r-l)( '\ jr^^zz-s'^'V^rl^^) 



is at most 

r -2 fs + 1 



(^^^^ ■ iW - 1)^^^ - " l)s^M + r{s - l)Mr]) 



(s-l)(s + l)V r 
where v = o H) (rj) . 

Proof. The difference under consideration, which we shall denote by Tij]) in the 
sequel, is piecewise convex by Claim 14.41 Using the definition of d it is not hard 
to see that for rj = d one has equality in the inequality we seek to establish and 
for these reasons it suffices to prove the statements 

{A) Ifr-2^t^s-2,then lim T'(r/) ^ (r - 2) 



and 



(B) lim r{r])^{r-2){;_^)M, 

r) — 
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where the superscripted plus or minus signs below the limit are intended to signify 
that 1] is supposed to approach the boundary value in question from the right or 
from the left, respectively. Throughout the computations that follow we use the 
function S as well as the formulae for J'{ri) corresponding to k = 2 and k = r 
from the foregoing proof. 

To verify (A), we divide into two cases. 
First Case: t = r — 2 

Note that the hypothesis of (A) yields s ^ r, wherefore if(^9r-i) ^ H{M), i.e. 

1 1 



Now let rj G (i?r-2? ''^r-i) be arbitrary. Then 

1 {r-l)S{ri)-{r-2) ^ 1 f s \ {r - l)S{r]) - {r - 2) 



which by the definition of S and H yields H{ri) ^ H{S{r])M), and thus rj ^ 
S{7])M. Also, 7] G ■ M,M) gives 

7]^ {r -l)ri - {r - 2)M > 0, 

which by S{ri) ^ 1 may be weakened to 

7] ^ S{ri) ((r - l)ri - (r - 2)M) . 

Consequently 

T] {S{r])M -T])^ S{7]) {S{r])M - r]) ((r - l)r] - (r - 2)M) , 

i.e. 

r] [{r - l)S{7])r] - r] - {r - 3)S{r])M] ^ MS{7]f [{r - l)r] - (r - 2)M] . 
Muhiplying by (r - 2){^,^^)S{r])-'^ [(r - l)r] - (r - 2)Af]"^ we infer 

as required. 

Second Case: r — l^t^s — 2. 
Notice that Claim 14.21 entails 

- 2){M - ^t)iM - '-f^t) > 0, 



(r - m ■ ' Z ~^ ~Z -(M- ^^*) ^ (r - 2)M. 
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whence 

i9t [(r - 2 - - (r - 3)M] ^ (M - ]i9t) {{r - l)^t - (r - 2)M) . 

By t > r — 2 the second factor of the right hand side is positive, wherefore 

(r - 2 - -ir- 3)M 
^* (r-l)^,-(r-2)M 

Multiplying by (r — 1) and subtracting M — ^i?* we obtain 

(r - 2 - - (r - 3)M 
(r - l)^t - (r - 2)M 

If we now multiply by (^1^) and use 

V — >^t t + L 

as well as the definition of ■j?^, we get indeed 

lim T'(ri) ^ (r - 2) ( ^ |M. 
This completes the verification of (A). 

So let us now continue with {B). From H{M) > F^-i ^ 2(s-^i) ) '-'^^ deduces easily 

S{M) > ^ ^ 

s r — 1 

where we have made the obvious definition 

S{M) = lim S{r]). 

T] — >M- 

This entails 

{s-r + l)S{M) 
(r - 1)S{M) - (r - 2) ^ ' 

which in turn implies 

1 (r - 1)S(A/) - (r - 2) ^'K S(Af) j 

Adding (r — 2)s and rearranging our terms, we infer 

(r i) ^^(^)-^ {s-r + l){rS{M)-{r-l)) 

^ ' S{Mf S{M){{r-l)S{M)-{r-2))^^ ' 

Multiplying this by ^ • (^1^) and exploiting the equation 

1 _ (r - l)S{M) - (r - 2) 
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that follows easily from the definition of S, one gets assertion (B), whereby 



Claim 14.51 has finally been proved. □ 

5. Clique densities. 
We now come to the unique central section of this paper in which we are going to 



provide a proof of Claim 12. ![ thus solving the clique density problem. 

Theorem 5.1. Whenever Q denotes a weighted graph and r ^ 2 an integer, we 
have g{Kr) ^ Fr{Q{K2)) . To say the same thing more verbosely, the estimate 

1 ^s + l\ ,^ _i 



holds, where s refers to a positive integer for which 7 = Q{K2) belongs to the 



interval 



2s ' 2(s+l) 



and a E [O, is required to satisfy 7 



2(s+l) V 

Proof. Since F2 is the identity function confined to [O, |), this is clear for r = 2. 
Arguing indirectly, let r ^ 3 denote the least integer for which this can fail] and 
take n to be the least order that counterexamples can possibly have. As we have 
already mentioned earlier, the function Fr : [O, |) — > [O, is continuous. Now 
the totality of all weighted graphs of order n may in an obvious fashion be regarded 
as a compact topological space, which implies that the continuous function defined 
on it by 

g ^ GiKr) - Fr{g{K2)) 

attains an absolute minimum. Now fix once and for all a weighted graph Q of 
order n for which this minimal value occurs and choose an integer s ^ 1 as well 
as a real number a G [O, ^] such that the number 7 = Q{K2) can be written as 
7 = 2{s+i) (-*- ~ '^^)- hypothesized failure of our theorem, we have 

QiKr) < ■ ! ^ V (1 + (1 - - ' 



[s + ly V 

which clearly can only happen if s ^ r — 1. Also, Corollary 13.41 tells us that 
a G (0, -^j, wherefore the function F^. is in particular differentiable at 7. As we 
have seen in Section 4, its derivative A = -F,'(7) is given by 

(r-l)r (s + l\ 2 



*By the results of Razborov and Nikiforov ([IH], H) we could assume r ^ 5 here, but 
actually there is no need for doing so. 
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Let Q as usual be presented by the sequence (xi, X2-, • • • , of non-negative reals 
summing up to 1 as well as the function A : [n]*^^-* — )■ [0,1]. Having thereby 
explained how to select that potential counterexample towards whose investigation 
we will direct our whole efforts, we hope to render the still rather lengthy remainder 
of the proof more intelligible by dividing it into five steps. 

First Step: Exploiting extremality. 

Clearly each of the numbers Xi, 0:2, . . . , x„ has to be positive, for if one among 
them vanished there existed another counterexample of lower order than n. For 
{i, j} G [n]*^^) it is sometimes convenient to write Aij instead of A{{i^j}). Given a 
sequence ii,i2, ■ ■ ■ ,im oi distinct integers from [n] as well as another integer p ^ 1, 
we set 

where for typographical reasons we have temporarily written / instead of [n] — 
{ii,i2) • • • )^m}- Note that for m = this coincides with our earlier notation, so 
no confusion can arise. Since (xi, 0:2, . . . , Xn) is an interior point of the simplex 

{(6, 6, ■ ■ ■ , ^n) e [0, 1]" 1 6 + 6 + ■ ■ ■ + = 1}, 

Lagrange's Theorem concerning the extremal values of multivariate functions 
reveals the existence of a certain real constant such that 

holds for ail i G [n]. The impact that extremality has on A is studied in 
(*) For each {i, j} G [n]*^^) one has Aij{\ — Qij{Kr-2)) ^ 0. 

This is obvious whenever Aij vanishes, so let us suppose now that this number is 
positive. If 77 denotes any sufficiently small positive real number, we may construct 
a weighted graph Q'^ agreeing entirely with Q except for the stipulation A!lj — 
Aij—T). Clearly one has g^{K2) — ^y—rjXiXj and Q^{Kr) — Q{Kr)—r]XiXjQij{Kj.-2)i 
wherefore 

which in view of the assumed extremality of g entails A ^ gij{Kr-2) and hence (*). 
Second Step: A morally flag algebraic consideration. 
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As in the proof of Proposition \'d.l\ we set 

= n ^'^^^ = n 

for all M C [n]. This time, however, we need the further stipulations 

= ^ ( 5Z ^ij)^M-{i}, 

ieM jeM-{'i} 
Cm = ^ Aq 

and 

^^^ = Y1 ~ ~ Aik)AM-{i} 

iGM{j,fc}g(A./-{i}){2) 

for all M G [n]*^''^-^''. What we shall need to know about these expressions is: 
(®) IfMe then Bm - (r - 1)Cm + Dm > {r + 1)Am. 

To see this, we note again that our inequality is linear in each of its variables, for 
which reason we may suppose A{E) e {0, 1} for all E e M^^\ Form a graph n 
with vertex set M by putting an edge between i,jEM exactly if Aij = 1. If H 
is free from cliques of size r, then Am = Bm = Cm = Dm = 0. If "H contains a 
unique such clique and i further edges, where ^ i ^ r — 2, then Am = 0, Bm = i, 
Cm = 1 and Dm = (^O ' therefore indeed BM-{r-l)CM + DM = C'l'^) > Am- 
If the graph "H possesses exactly two cliques of size r, then it misses precisely one 
edge and Am = 0, Bm = 2(r — 1), Cm = 2 as well as Dm = 0. Finally, if "H 
happens to be a clique, then Am = 1, Bm = r{r + 1), Cm = r + 1 and Dm = 0. 
This analysis proves (®) in all possible cases. 

Multiplying the inequality just obtained by Xm and summing over M, we deduce 

(®) E (Bm - (r - 1)Cm + Dm)Xm >{r + l)g{Kr+i). 

A/e[n]('-+i) 

Next, we propose to ponder the sum 

ie[n] 

Expanding the product, we get a plethora of terms involving each of the variables 
xi,X2, ■ ■ ■ ,Xn at most linearly and their sum is easily seen to equal 

BmXm- 

MG[n] ('-+!) 
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The remaining "quadratic" terms sum up to 

{ij}e[n](2) 

Utilizing the inequahty (*) from our first step, this sum is estimated to be 
^ ^ {xlxj + Xix])AijQij{Kr-2) - A$, 

where for brevity 

{ij}e[n](2) 

The first of these sums may also be written as 

Qe[n]W 9e[n]-Q Me[n]('-+i) 

Thus we have altogether 
E xMKi)gi{Kr-i) >{r- l)G{Kr) + E (^M - (r - 1)Cm)Xm - A$. 

ie[n] Me[n]('-+l) 

Exploiting ((§)) the right hand side can further be bounded from below by 
{r-l)g{Kr) + {r + l)g{Kr^^)-\^- J] DmXm- 

Me[n]('-+i) 

Now notice that trivially 

Me[nl('-+i) ie[nl 0-,fe}e([n]-{i})(2) 

Hence writing 

^{i,j,k} = - - Ak) + - Aj){i - Ajk) + - Ak){i - Ak) 

for all {i,j,k} e [n]*^^) and applying (*) again, we obtain 

rG[n](3) ie[n] 

The Lagrange equations entail the right hand side to equal 

A Y ^MKif - 27A* 

ie[n] 
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and in view of the by now straightforward calculation 

ie[n] {ij}e[„](2) 

{i,i,fc}e[n](3) 

= + ^ XiXjAij (l- ^ Xfc) 

{i,j}6M(2) ke[n]-{i,j} 
{ij,fc}G[n](3) 

- yTXT+?>Q{K^) 

re[n](3) 

= J] VTXT + -t + ?>Q{K^) 

Te[n](3) 

we finally arrive at the main estimate of this step, namely 

(r - l)g{Kr) + (r + ^ A(7 + ?,g{K^)) - 27/x. 



r/izrc? Step: Introducing and estimating M . 
Let us now define a real number M such that 



{r -2)r fs + 1\ ^ 



Since 

r • Q{Kr) = Y XiQi{Kr-i) = Xi{Xgi{Ki) - n) = 27A - fi, 

ie[n] ie[n] 

we have 

g{Kr) = r ^ ~ ~ - (r - 2)(M - 1)] , 

which in view of the presumed smallness of the left hand side gives M > 1. 
Eventually we shall prove M ^ 1 as well, thereby reaching a final contradiction, 
but before we can realistically hope to do so we first need to provide a much weaker 
upper bound on M, so that the results from our fourth section become available. 
This is our next immediate task. To achieve it, we find it convenient to introduce 
the abbreviations 

A 



B 



(r-2)r + 1 



(s + l)^V 
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and 

Notice that the last of these stipulations is permissible, as Proposition 13.11 entails 
Q{Kr) > in view of 7 > ^ ^ 2{Y-i) ■ ^PP^yi^S inequality between the 
arithmetic and geometric mean of r — 1 positive real numbers, one of which is 
equal to r ■ Q{KY) while each of the remaining r — 2 numbers equals C(l + 

we get 

r ■ g{K,.) + (r - 2)C(1 + a)""! ^ 2(r - 1)^7(1 + aY'^ = 27A = r ■ g^K^) + /i 

and hence (r — 2)C ^ BM. Rising both sides to their (r — 2)'^'^ powers we infer 
after some easy simplifications 



si \ r 



Using now Corollary I3.3[ we obtain 



s J s — 1 

as desired. 

Fourth Step: Induction on n. 

By (*) we have for each i G [n] the inequality 

(r - = ^ XjAijgij{Kr-2) ^ ^ XxjAij = Xgi{Ki) 

je[n]-{i} je[n]-{i} 

and hence by the Lagrange equation 

{r-l){Xg,{K,)-fi)^Xgi{K,), 

i.e. 

Thus if we define the real number rji to obey ^j(-ft'i) = -^{^ + Oi)rji, then rji ^ M. 
Similarly but easier we have 

Q^gi{Kr-i) = xg,{Ki)-^i, 

whence gi{Ki) ^ ^, so that altogether we get r/j G ■ M, M] . 
The main objective of this step is the verification of 
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(ffl) For each i G [n] one has 

X (i(r - l)s9^ - (r - l)s^M + r(s - l)r7iM) . 



Plainly it suffices to show this for i = n and for brevity we are henceforth going to 
write T] instead of rjn- As Qn{Ki) is positive, we may construct a weighted graph 
Q* of order n — 1 specified by the numbers x* = q^^^[-^ for i = 1, 2, . . . , n — 1 as 
well as the restriction of ^ to \n — 1]'^^^ Our minimal choices of r and n entail 
Q\Kr^x) > Fr^iiS) and Q*{Kr) ^ where 6 = Q*{K2). By our construction 

of Q* and the Lagrange equation for i = n, we have 



r-lj 



Expressing the right hand side in terms of r, s, M and r/, this simplifies to 
Q*{Kr-i) = H{r]), where H refers to the function defined just before Claim HTTl 
Stipulating therefore u = F^\{H(r])), as in the hypothesis of Claim I^^Sj we have 
6 ^ u. Now in view of 

= |#4 and g*iKr) ^"^^'■^ 



we get 

By our bound on G*{Kr), the difference in curly braces is at most Q{S), where Q 
signifies the function introduced in Claim I4.3[ and by that claim itself this is in 
turn at most Qi^)- Estimating now the product "rfQiv) by means of Claim 1^31 
we finish proving (ffl). 

Fifth Step: An excellent finish. 
Notice that 

^(1 + a)Y, ^^Vr = Yl ^MKi) = 27 = ^(1 
s + 1 s + 1 

wherefore 

XiTii = 1 - a. 



a 



«6 n 
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Thus multiplying (ffl) by Xj and adding up the n resulting inequalities we conclude 

(|(r - _ _ ^ - 1)(1 - a)M) . 

Combining this with the main result from our second step and plugging in the 
formulae expressing 7, A, and Q{Kr) in terms of r, s and M from the first and 
third step we get an estimate that on first sight looks rather lengthy. After massive 
cancelations, however, it just reads 



(1 - a) - 2M ^ vl±i^(^2 _ 2M^). 
— 1 

Since ^ G [^M,M] by Claim |421 we have 

- 2M^ = {M- -M^ ^ ■ M^, 

whence 

(l-a)-2M^-(l + a)M^ 

i.e. 

(1 - M)((l -«)-(! + a)M) ^ 0. 

But if M really was greater than 1, as suggested by our third step, then both factors 
of the left hand side were negative. This contradiction finally proves Theorem 15. 1[ 

□ 
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